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Abstract 



In the context of auctions for digital goods, an interesting random sampling auction has been proposed 
by|Gold bcrg, Hartline, and Wright 200 lj. This auction has been analyzed by |Feige, Flaxman, Hartline,| 



and Kleinberg 2005 , who have shown that it is 15-competitive in the worst case - which is substantially 
better than the previously proven constant bounds but still far from the conjectured competitive ratio 
of 4. In this paper, we prove that the aforementioned random sampling auction is indeed 4-competitive 
for a large class of instances where the number of bids above (or equal to) the optimal sale price is at 
least 6. We also show that it is 4.68-competitive for the small class of remaining instances thus leaving 
a negligible gap between the lower and upper bound. We employ a mix of probabilistic techniques and 
dynamic programming to compute these bounds. 



1 Introduction 

In recent years, there has been a considerable amount of work in algorithmic mechanism design. Most of 
this work can be divided into two categories based on their assumption about prior: (i) Bayesian, and (ii) 
prior free. Bayesian mechanism design is based on exploiting the knowledge of the prior to optimize the 
expected performance, whereas prior free mechanism design is aimed at optimizing the worst case perfor- 
mance. Random sampling is perhaps the most popular technique in prior free mechanism design, yet an 
accurate analysis of its performance has proven difficult even in the simplest applications. 

This paper focuses on analyzing the performance of the random sampling auction proposed by Goldberg 



et al. 2001 , known as the "Random Sampling Optimal Price (RSOP)" auction. The basic problem can be 
described as follows. A seller has unlimited supply of a good (e.g., a digital good) [j] which he is going to 
sell to unit demand bidders through the following auction: bids are partitioned into two sets uniformly at 
random; then the optimal (revenue maximizing) sale price is computed for each set, and offered as the sale 
price to the opposite set. The expected revenue of RSOP is then compared against the optimal revenue of 
single price sale of at least two copies. 

Most of our analysis is based on the following approach: we develop a lower bound on the performance 
of RSOP that depends on the level of balancedness of the partitions, but independent of the bid values; we 
then take the expectation of this lower bound over the varying level of balancedness to obtain a general 
lower bound on the performance of RSOP. That is in contrast to the previous work based on showing that 
a certain level of balancedness is met with a reasonable probability, which inevitably requires a tradeoff 
between how strong the balancedness condition is versus how likely it holds. 



*A preliminary version appeared in the 10th ACM conference on Electronic Commerce, 2009. 
tld: rsop.tex 55 2013-03-10 23:35:24Z saeed 
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Related work. 



The random sampling optimal price (RSOP) auction has been proposed by Goldberg et al. 

The revenue of RSOP has been 



2001 , but the problems was first studied by Goldberg and Hartline 2001 



2003 , and Balcan et al. 



2005 



shown to be close to optimal for many classes of interesting inputs by Segal 
There has also been a fair amount of work analyzing the competitive ratio of RSOP. |Gold bcrg and Hartline 
[2001] showed that RSOP obtains a constant fraction of the optimal revenue, and conjectured the constant to 
be 1/4; note that the conjecture is tight for an instance with 2 bidders with distinct bids. A better analysis 
was proposed by Feige et al. 2005 which proved the constant to be at least 1/15 . 



It is important to prove that RSOP is 4-competitive, because it is a natural and popular mechanism which 



is easily implementable and adaptable to various settings (e.g., double auctions Baliga and Vohra 


2003 , 


online limited-supply auctions Hajiaghayi et al. 


2004 , combinatorial auctions Balcan et al. 


2005 , 


Goldberg 


and Hartline 


2001 , and other setting such as 


Hartline and Roughgarden 


2008]). Indeed the results of this 



paper have been used in analysis of other auctions such as the random sampling based auction of |Devanur 
and Hartline 2009 for limited and online supply. 



Results. The following is a summary of our main results. 

I. Improved lower bounds: We prove that the ratio of the expected revenue of RSOP to its benchmark 
is: 



due to Feige et al. 2005 



at least 1/4.68 (e.g., Theorem 1 and Theorem 2), improving the previous lower-bound of 1/15 



• at least 1/4, if there are at least 6 bids above (or equal to) the sale price. 

• at least 1/3.52, as the number of bids above (or equal to) the sale price approaches infinity. 

Our analysis suggests that the worst case performance of RSOP is attained when there are only two 
bidders with distinct bids. 

II. Upper bound: We show that there exist instances where the expected revenue of RSOP is still less 
than 1/2.65 of its benchmark, even when the number of bids above the optimal sale price approaches 
infinity. 

III. Combinatorial approach: We also present a combinatorial lower bound on the performance of RSOP 
for a special case when each non-zero bid can take one of only two possible values. 



2 Preliminaries 

We consider auctioning a digital good to n bidders whose bids are represented by the vector v — (vi, . .. , v n ) 
which is, without loss of generality, sorted in decreasing order. 

Definition 1 (RSOP). The random sampling optimal price auction partitions the bids into two sets A and 
B uniformly at random computes the optimal sale price of each set, and offers it as the sale price to the 
opposite set. 

Definition 2 (OPT). The optimal revenue from single price sale to at least two bidders is 

OPT = max jv«. (1) 
j>2 1 w 



See Goldberg et al. 2006 for motivation of the definition of OPT and why it requires selling to at least 
two bidders. 



2 I.e., each bid independently goes to one of A or B with probability =. 



2 



Assumptions. We assume A and B contain the indices of the bids (as opposed to the actual value 
of the bids). Without loss of generality we assume there are infinitely many bids, i.e., Vj — for all 
j > n; consequently (A, B) is a partitioning of N. The previous assumption allows us to make our analysis 
independent of n. Also, without loss of generality we assume lgB.| 

Throughout most of our analysis we ignore the revenue of RSOP from bidders in A because in the 
pathological case where V\ is too large (e.g., v± > OPT), the optimal sale price for B is equal to V\ which 
yields no revenue when offered to A. 

Notation. We adopt the convention of using bold letters for vectors, capital roman letters for sets, capi- 
tal italic letters for single-dimensional random variables, capital bold letters for multi-dimensional random 
variables (such as sets or vectors), and capital calligraphy letters for events. 

We will use E[RSOP] to denote the expected revenue of RSOP on an implicit bid vector v, where the 
expectation is taken over all random partitions (A,B); however we sometimes specify an explicit bid vector 
by writing E[RSOP(v)] or OPT(v). 

We use A to denote the index of the optimal sale price which sells to at least two bidders, i.e., 



A € arg maXj-^2 jVj (2) 

For every j e N, we define 

^ = |An{i---j}|, (3) 

_ |Bn{i---j}| _ 

^ |An{i-j}r Si ' w 

Z = min ({Zj} i6N , 1) . (5) 

Note that Sj, Zj and Z are random variables which depend only on how the bids are partitioned, but not 
on the actual value of the bids. 

For every TcN and a, a' G [0, 1], we define the following events: 

^ = {? e a T x 7^ a }' (6) 

= {«' < max ^ < a\ = El \ El (7) 



Figure 1 illustrates an example of £„ and £ , , ,. We omit T if T = N, i.e., £ n — and £ , , , = £ , , , 



Finally, for any random variable X and event £ , we use ~E[X \ £] to denote the expectation of X 
conditioned on event £ normalized by the probability of £, i.e., 

E [X\£ } = E [X | £ } Pr [£ } , (8) 

We will use the following proposition extensively throughout this paper. 

Proposition 1. For any random variable X and any two events £,£' , 

• ifS'C £, then E[X \£\£'] = E[X | £} - E[X | £']; 

• if £n£' = %, thenE[X \£U£'] = 1 E[X \£]+E[X \£'] 

The following lemmas will be useful throughout the rest of this paper. 

Lemma 1. For any T,T' C N and a £ [0,1], the two events £j and £^ are positively correlated, i.e., 
Pr[£j n ££] > Pr[f T] Pr[£l'} (alternatively Pr[£l n £^\ < Pr [£ Pr[£~l']). 

Proof. The claim follows directly from the FKG inequality and can be found in |Appcndix B| □ 



' Otherwise we can swap A and B. 
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Lemma 2. For any a £ (0, 1) and j £ N, 

if a> 0.5, then Pr Is^] > 1 - (r a ) J , w/iere r a = - — -j— -, (9) 

L J 2a a {l — a) 1 a 

if a < 0.5— i/ien Pr £^ < ( r ( a +i/j)Y 1 where r a is the same as above. (10) 

Proof. The claim follows from a direct application of Chcrnoff-Hocffding bound and can be found in |Ap-| 
Ipendix B| □ 

3 The Basic Lower Bound 

In this section we prove that RSOP is indeed 4-competitive for a large class of input instances (i.e., when 
A > 10). In the next section, we improve this result for A < 10 using a more sophisticated analysis, but 
based on the same ideas. The following theorem summarizes the main result of this section. 

Theore m 1. E[RSOP] > \ OPT for all A > 10. Furthermore, E[RSOP] > ^ OPT for all A > 5000. 
\Table l\ lists the actual lower bounds obtained for various choices of X. 



The outline of this section is as follows. First, we present a lower bound on E[RSOP] as a function of A. 
Recall that expectation is taken over all random partitions (A, B) for a fixed set of bids (and thus a fixed A). 
Our proposed lower bound depends only on A and not the actual value of the bids. We present a dynamic 
program for numerically computing the lower bound for any fixed A. By computing the lower bound on 
E[RSOP] for all A £ {11 • • • 5000} we confirm that it is indeed greater than | OPT. We then prove a lower 
bound of OPT on E[RSOP] for all A > 5000. 

The following lemma provides a lower bound on E[RSOP] as a function of A. 

Lemma 3. E[RSOP] >E[^Z] OPT. 

Proof. Let v\ A be the optimal price for A which RSOP offers to bidders in B; observe that S\ A v\ A > SjVj 
for all j £ N. The revenue of RSOP is at least the revenue it obtains from B, therefore 

RSOP > (Aa — S\ A )v\ A because at least Aa — S\ A bids in B are above or equal to v\ A 

= Z\ A S\ A v\ A by definition of Z\ A in Q 

> ZS\v x because Z < Z Xa and S\V\ < S\ A v\ A 

= Z^- OPT because OPT = \v\. 

A 

Consequently, E[RSOP] >E[^Z] OPT which proves the claim. □ 

It is crucial that the lower bound provided by the above lemma only depends on A and not on the exact 
value of the bids. Recall that A depends only on the value of the bids and not on how the bids are partitioned. 

3.1 Small A 



We start by proving the first part of Theorem 1 i.e., that E[RSOP] > \ OPT for all 10 < A < 5000 



Recall that E[RSOP] > E[^Z] OPT by Lemma 3 Ideally, we would like to approximate E[^Z] by 



E[^] E[iT], however ^ and Z are negatively correlated. To work around this obstacle we will decompose 
E[-^-Z] over a set of small and disjoint events such that, conditioned on each such event, Z can be approx- 
imated closely by a constant. The events are defined as follows. We partition the interval [0, 1] to small 
disjoint intervals by picking m points 0.5 < otx < • • • < a m < 1. For each interval ( o^-i, on] w e consider the 



s ■ 

event £^ a , ia .y Recall that £^ a . i a ., is the event that (maxj -j) £ (aj_i,aj] (see Figure 1). Conditioned 

rl-aj l— an- 



on £, a . i Q .j, it is easy to see that Z £ [ ~ ai , ), and therefore we can obtain a good lower bound by 

substituting Z with 1 "^ a ' . Notice that there is no use in picking on from [0,0.5] because for any a £ [0,0.5], 

Pr[£ Q ] = and therefore, for any bounded random variable X, we get E[X | £ a ] — E[X | £ a ] Pr[£ Q ] = 0. 
Also notice that there is no use in considering the event £, ^ because we can only guarantee a trivial lower 



bound of for Z under £, a ^ . 
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a = 3/4 



(a) £ a denotes the event that the points (j,Sj) lie below 
the line y = ax (the gray area) . This figure shows the plot 
of (J, Sj) for an instance of random partitioning in which 
the event £ a with a = § has occurred. 




(b) £( ai _ 1 a A denotes the event that the point 

(j,Sj) with the highest ratio of -4- lies on or 
below the line y = a t x and above the line y = 
oti~\x. This figure shows the plot of (j, Sj) for 
an instance of random partitioning in which the 



event £ 



(c*2,e*3] 



has occurred (indicated by the 



dark gray region). 



Figure 1 



Lemma 4. Given an increasing sequence a.\, . . . ,a m € (0.5, 1), the following inequality holds for any non- 
negative random variable X. 



E [XZ] > J2 



i.=i 



1 1 



v[x\e a 



Assume a m+ i = 1. 

Proof. Let ao = 0. We decompose E[XZ] over the set of disjoint events £, ■., . . . , £r ctm _ 1 a i as follows. 



E 



[XZ] > ^2 E \XZ 



> 



X- 



(ai_i,Qi] 
1 — CV. 



(Qi_l,Q ; ] 

s. 



by law of total expectation 



1 — a- 

because Z > conditioned on 8, 



(a 4 _i,a<] 



X 



by Proposition 1 given that 8^ a ._^ a ^ = 8 a . \ 8 a ._ x 



by rearranging the terms. 



Note that in the last step we have used the fact that E[X | 8 ag ] = (because Pr[£' Q , o ] = 0). 



□ 



The choice of m and arj., . . . , a m in |Lcmma~4| greatly affects the value of the lower bound. Generally 
speaking, increasin g m impro ves the lower bound but at the cost of more computation. 

effectively, we need to be able to compute E[^ | 8 a ] for each cti. However 



In order to use 



Lemma 4 



the events 8 a . are hard to deal with computationally. The next two lemmas show that E[^ | 8 a .] can be 

bounded below and thus approximated by E[-^ | 8a] ] — e where e approaches exponentially fast as a 
function of I. 
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Lemma 5. For any random variable X 6 [0, 1], any a 6 (0.5, 1], and £ € N i/ie following holds: 



E 



> E 



A" 



Pr 



Proof. Observe that £ Q = si 1 " £} \ (fi^ 1 '"^ n Q therefore 



E 


X 


s a 


= E 


X 


£ {1- 


•n' 








> E 


X 


£ {l- 


•n' 








> E 


X 




•n' 



E 
Pr 
-Pr 



{t+l—oo} 



{£+l---oo} 



1 - Pr 



by Proposition 1| 
because X g [0, 1] 



Si 1 "* 



1 - Pr 



£{£+l-oo} 



by Lemma T] 



The following lemma allows us to compute an upper bound on the e of the previous lemma. 
Lemma 6. For any a £ (0.5, 1] and any £, £' £ N such that £ < £' , the following holds: 

t'+V 



Pr 



> 1 



1-Tn 



II (}-(r«Y 



j=t+i 



Proof. 









OO 




Pr 


c{<M-l-oo}~ 


= Pr 


n & 


> Pr 








j=e+i 





n & 

■j=i'+i 



n pr [* } 



by |Lemma 1 



> 



-1 \ * 



{:/} 



n pr N' 



(12) 



□ 



where r a is defined in Q (13) 



> 1 



(r«r 



l-r„ 



n O-w) 



by union bound 
by |Lcmma 2 



j=e+i 



□ 



Observe that in the special case of the above lemma in which 1 = 1', the right hand side of ( 13 1 approaches 
1 exponentially fast as a function of £ which implies that e in ( 12 ) approaches exponentially fast as a function 
of £. Choosing £' > £ only improves the bound. 

The next lemma provides a recurrence relation which can be used to compute the exact value of 
E[^ | Si 1 -^} and Pr^i 1 " n ] in time 0{£ 2 ). 



4 £^ +1 is the complement of 



G 



Lemma 7. For any t 6 N and a G [0, 1], i/ie exact value o/E[ J j L | fi 1 ^] and Pr^ 1 can be computed 

using the following recurrence in which ^ = S^ 1 ^ H {Si = k} is the event that ^ happens and 
St = k. 



E 



E 



Pr 



A 



Pr 

Sx 
A 



{i- 

Q: . A; 



Si 1 ' 




a,k J 



£>l,k<a£ 
£ = l,k = 
otherwise 



Q, fe J 



£> X,k<a£ 
£ = A 
otherwise 



fe=0 



decomposed as two disjoint events S^-i 1} t^-Ai and S^k 1 ^ H«4^, therefore its probability is the sum of the 

' 1 ^ and ^ are independent for any £ and fc and Pr [^] 



Proof. Let ^ denote the event that ^ € A. First consider (j 1 4|) : if £ > 1 and fc < a£, then £ 

'a,k-l 

probabilities of those two event; note that 



{1-1} 



(14) 

(15) 

(16) 

(17) 
can be 



furthermore the base of the recursion is Prff^g] = 1 because by our assumption Ai = (i.e., the highest 
bid is always in B). The same argument implies the correctness of (15) for the case of £ > A. Furthermore, 
sj^L X ^ by its definition implies Sx — k which implies the correctness of ( 15 ) for the case of £ = A. Finally 
(16) and (17) follow trivially from the law of total probability and the law of total expectation. □ 



Proof of Theorem 1 for small A (i.e., 10 < A < 5000J. We show how to numerically compute a lower bound 
on E[RSOP] for any fixed A. Let m = 100 and a, = 0.5 + for each i £ [m]. Observe that 



E [RSOP] > E 



A 



> 



E 

1=1 



OPT 
1 



by Lemma 3 



Cti+l 



E 



Sx 
A 



OPT 



by |Lemma 4| 



Lemma 5 



Lemma 6 



and 



Lemma 7 



with £ = 5000 



We then compute a lower bound for each E[^ | £ a .\ (using 
and £' = 100000), and substitute them in the last inequality above to obtain a lower bound on E[RSOP] 
We have confirmed that E[RSOP] > \ OPT for all A € {11 • • • 5000} by numerically computing the lower 
bound for each choice of A in that range. The computed numerical values of our lower bound are listed in 
ITablc II for various choices of A. □ 



3.2 Large A 



We now prove the second part of Theorem 1 i.e 



Lemma 3 



Recall that E[RSOP] > E[^Z] OPT by 
thus E[^]E[Z] does not yield a lower bound on 



3^2 OPT for all A > 5000. 

Sa. 
x 



EJRSOP] > 

Also recall that % and Z are negatively correlated, 
a lower bound on &[^-Z]. Nevertheless, the correlation decreases as A 
increases which suggests that for sufficiently large A we can separate the two terms. In other words, when 
A is large (i.e., A > 5000), the two random variables ^ and Z are almost independent and so the expected 
value of their product is very close to the product of their expected values. Also for a large A the value of 
^ is very close to \ so E[yZ] is close to h ~E[Z]. We formalize this argument in the following lemma. 

Lemma 8. For any a <E [0, 1]; 



E 



> a E 



Pr 



£i A} ]) 



(18) 
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Proof. 



E 




= E 




-c{X}~ 


+ E 




f{X} 








J 









>aE 



*{x} 



because — - > a conditioned on £^ 
A 



(> 


z 


— E 


Z 


HA}" 


) 


(> 


z 


Pr 


f{X}~ 


) 


because Z < 1 by its definition, i.e., equation ^ 



□ 



Lemma 2 



Also observe that, for any 



Recall that we can compute an upper bound on Pr[fi^] using 
fixed a £ (0, 0.5), Pr [£i X ^] approaches exponentially fast as a function of A as A — » oo. The only remaining 
task is to compute a good lower bound on E [Z] . 



Lemma 9. E[Z] > 0.61. 

Proof. Let £ = 60000, m — 100, and ai — 0.5 + for each i E [m]. By applying 
X = 1 we set 



Lemma 4 



and plugging 



V[Z]>J2 



1 1 



\ \tti OL i+ i 
1 1 



> V I - - — ] Pr 

~{ \OLi Oi+i 



We then use 



to compute Pr[£iJ and use 



Lemma 6 



by [Lemma l| 
with £' — 100000 to compute a lower 



Lemma 7 

bound on Pr[£af +1 "°°*]; by substituting the computed numerical values in the above inequality we get 
E[Z] > 0.61. □ 

It is worth mentioning that by using a similar method we have computed an upper bound of E[Z] < 0.63 
which indicates that our analysis is almost tight. 



Proof of {Theorem l\ for large A (i.e., A > 5000/ Let a = 0.48. Then 



E [RSOP] > E 



A 



> a E 



OPT 



Pr 



OPT 



by |Lcmma 3| 
by |Lemma 8| 



Lemma 2 



we get Pr[fi A} ] < 0.0183 for all A > 5000; furthermore E[Z] > 0.61 by 



Lemma 9 



substi- 



Using 

tuting them in the above inequality we get E[RSOP] > 0.284 which is equivalent to a competitive ratio of 
3.52. □ 



4 The Exhaustive Search Lower-Bound 

In this section we propose an exhaustive search approach which yields an improved lower bound for RSOP 
for small choices of A (i.e., A < 10). The following theorem summarizes the main result of this section. 



Theorem 2. E[RSOP] > j|g OPT for A > 2 and E[RSOP] > \ OPT for A > 6. 

lower bounds obtained for various choices of X. 



Table 2 lists the actual 



3 Note that 1 — 1/e ~ 0.6321 which is slightly greater that the upper bound of E[Z] < 0.63. 
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The basic lower bound of E[RSOP] > E[^Z] OPT which we used in Section 3 does not yield a good 
enough bound when A is small, mainly because 

(I) ^ and Z are negatively correlated and their correlation is much stronger when A is small, also 



(II) the highest bid is always in B, so E[^] approaches \ as A goes down to 2. 



Table 1 



are 



Therefore, for A = 2, E[^Z] < E[^] E[Z] < 0.25 x 0.63 w ^ In fact, the lower bounds of 
quite close to the exact value of E[^Z], which suggests for small values of A we need a different approach 
We now provide a high level description of the approach of this section. Without loss of generality we 
assume OPT = 1. In addition to fixing the index of the optimal price, A, we fix the index of the second 
optimal price of a higher index, A', and also fix its corresponding revenue, OPT', i.e., 



A' G arg max J>A jv 
OPT' = max jvj 



(19) 
(20) 



We then try all possible values for OPT' and Vi,...,va (for an appropriate choice of d), and apply the 
techniques from |Section 3| to the remaining bids; however instead of fixing the exact value of each bid and 
of OPT' — which would require checking infinitely many instances — we restrict each bid and OPT' to an 
interval, i.e., OPT' € [ OPT ', OPT ] and Vj G [vj, Vj] for every j G [d]. We then try various configurations of 
such intervals to cover all possible scenarios. For each configuration we compute a lower bound on E[RSOP] 
for each A' < 5000 as a function of both A and A', and another lower bound as a function of only A assuming 
a reasonably large A' (e.g., A' > 5000). We then take the minimum lower bound among all configurations 
and all A' to obtain a global lower bound on E[RSOP] for each A G {2 • • ■ 10}; the computed lower bounds 
are listed in lSection 41 

Lemma 10. Let RSOPMinExpect(A, {[vj, Vj]}j^[d] , [ OPT '. OPT']) denote the minimum expected revenue 

o/E[RSOP] over all instances where OPT' € [ OPT ', OPT ], and Vj G [z; - , Uj- ] for all j G [d] and given that 
A is the index of the optimal price. Then for any 9, 9' G N, 



E [RSOP] > min 

(i 1 ,...,i d )e[ 

i'eW'] 



RSOPMinExpect A, 



1 1 



J 



ie[d] 



(21) 



Proof. The claim follows because the minimum is taken over all possible combinations of intervals and that 
any bid vector is covered by at least one of the combinations. □ 



Note that some combinations of intervals in (21) might be inconsistent/infeasible; for example it is 



infcasible to have both 



L 10 ' 10 J 



and [^3,^3] = [jt, A] because that would imply V3 > V2] we define 



RSOPMinExpect to be 00 if a configuration of intervals is infeasible. 



Computing a lower bound on RSOPMinExpect. In the rest of this section we show how to compute 
a lower bound on E[RSOP] given the assumption that OPT' G [0PT',0PT'] and Vj G {[Vj,Vj]} for all 
j G [d], where these intervals are specified exogenously. The high level idea is to enumerate all possible 
partitions of the first d bids, define an event for each such partition and decompose E[RSOP] over those 
events, and compute a lower bound conditioned on each such event. 

We start with a few definitions. For every T C [d], we define the following event 



A T = {An{l-d} = T} 



(22) 



Intuitively, At is the event that, among the first d bids, the subset of bids that fall in A is exactly T. 
Observe that under the event At, both Sj and Zj are constants (for every j G [d]); we will denote those 
constants respectively by 



3 



(S i |,4 T ) = |{i...j}nT| 

|{1-"J}\T| 



(Zj I At) = 



|{i...j}nT| 



(23) 
(24) 
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Our approach is to decompose E[RSOP] over the set of disjoint events {.At}tc{2— d} an d then decompose 



E[RSOP | At] further over the set of disjoint events {£i a 



{d+l---oc 



}ie[m] f° r some choice of 0.5 < ot\ < ■ ■ ■ < 



a m < 1 (the second decomposition is similar to Section 3 ) ; formally, 



E [RSOP] = ^ E [RSOP 

TC{2-d} i=l 



At n gj d+1 -°°y 



Next we show how to compute a lower bound on E[RSOP | At fl "a^l does not depend on 
the exact value of the bids. 



(25) 



Lemma 11. For any d > 2, any T C {2 • ■ ■ d}, and any 0.5 < a' < a < 1, 



E 



RSOP 



At n s}^ 1 ;-^ 

1 {a ,aj 



> E 



max(r T ^OPT')^ 



At n Sf d +\-°° } 



where r T and are constants defined as 



T T 

r = maxs., u. , 
je[d] J ~ r 



A T = {j £ T | ,sj^- > r T } , 



T 
Pa 



i/ r T > aOPT' 
izj j £ A T | , ^r") otherwise 



(26) 

(27) 
(28) 

(29) 



Proof. Let i>a a be the optimal price for A which RSOP offers to bidders in B; observe that 

RSOP > (A A - S Xa )v Xa = 'S'aa'^Aa^Aa 

Under event At H £(^a] we snow that S\ A v\ A > max ^r T , ^OPT'^ and Z\ A > pj, which combined 
with the above inequality imply the statement of the lemma. 

• S\ A v\ A > max ^r T , ^-OPT'^. Notice that S\ A v\ A is the optimal revenue of A which must be at 

least r T ; furthermore, the optimal revenue of A is no less than the revenue of selling to A at price vy 
which is at least %^ OPT '. 

• Z\ A > The inequality follows immediately by considering the following two possibilities: 

T T 

(I) A a < d. In this case A a must be in A , because for any j 6 {1 • • • d} \ A , selling to A at price 
Vj generates a revenue which is less than r T , therefore Vj cannot be the optimal price for A. 

(II) Aa > d. First we claim that this case cannot happen if r T > aOPT , because otherwise the 
revenue of selling to A at price v\ A is less than r T which contradicts its optimality. [^] 



If indeed Aa > d, then Z\ A > under event £} 



{d+l-.-co} 
(a' ,ct] 



□ 



Lemma 12. For any increasing sequence ct\, . . . ,a m £ (0.5,1) the following inequality holds (assume 

tttntl = V- 



E 



[RSOP]> £ E(^"^x) 

TC{2-d} i=l 



E 



max r 



^~OPt/) A T n£^ +1 -^ 



(30) 



6 The revenue of selling to A at price v\ A is -Saa^Aa wrucn i s a * most «A A t)A A = ctOPT' under event £^?~^ ■ 
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Proof. The claim follows by applying 



„„ p{d+l— 00} v c\° 



{d+l---oo} 



Lemma 11 



{c!+l--oo} 



to equation (25), then decomposing each event £} i Q . 



and applying Proposition 1 and then rearranging the terms. 



□ 



Next we sketch the proof of the main theorem of this section. 



Proof of \Theorem 2\ We use Lemma 10 with d = 11, 9 = 3 and 9' = 100 together with |Lcmma 12 with 



;1 for each i G [to]. To compute an accurate approximation (lower bound) 
%^opt') l ^n^ 1 '"" 1 ], we use a combination of dynamic programming 
and tail bounds similar to those of Lemma ~5{ |Lcmma~6 Lemma 7 Lemma 



to = 100 and a, = 0.5 -\ 
on each term E [max ^r T 



and Lemma 9 (observe 



that Sy is the only random variable in this term). However doing so naively requires computing a lower 
bound on as many as 9 d ~ 1 9'2 d ~ 1 



£, 



{d+l-oo} 



E [max 



] for all c, d G {§7,. 



t s^opt' ) I A T n 



to such terms]^] Instead, we pre-compute E[max(c, Syd) \ {Sd — a} n 
7}, all a G {0 • • • d}, and all a G {ai, . . . , a m }; and then we approximate 
with E[max(c, S\>d) \ {Sd = a} fl £i d+1 '"°°^] where c,d are the 



result of rounding r T and °^, T down to the nearest integer multi ples of -g r respectively and a = |T| and 

lists the lower bound obtained 



Table 2 



a = a,. r\ Notice that we only need to pre-compute (9' + l) 2 dm. 
for each A G {1 • • • 10}. As a last note, we should mention that we refine each configuration of intervals by 
cutting off infcasible regions of each interval prior to any further computational □ 



5 An Upper Bound on The Performance of RSOP 

It has been previously shown that there exist instances of bids for which E[RSOP] is as low as \ OPT (e.g., 



Feige et al. 2005 , Goldberg and Hartline 2001| ). However, all such instances have A = 2. That raises the 



question of whether the performance of RSOP approaches optimality asymptotically as A — > 00. In this 
section, we exhibit a family of instances for which E[RSOP] is no more than T^gg OPT as A — > 00, which 
proves that the asymptotic competitive ratio of RSOP is no better than 2.65. 

Theorem 3. For any A > 2 there exists an input instance where there are A bids above or equal to the 
optimal sale price and such that E[RSOP] < ^5 OPT. 

Next, we define a family of instances which are used in the proof of the above theorem. 

Definition 3 (Equal Revenue Instance). An instance of bids is called an equal revenue instance if choosing 
any of the bids as the sale price yields the same revenue. The equal revenue instance with n non-zero distinct 
bids is unique (up to scaling) and given by the bid vector q( n ) = (q^ 1 \q% , ■■■), where 

3 1 otherwise 

Proposition 2. For any equal revenue instance, RSOP offers the worst price to each of the sets A and B. 
In other words, the optimal price of each set generates the least revenue when offered to the opposite set (i.e., 
less revenue than offering any of the other non-zero bids as the sale price). 

Proof. It follows immediately from the fact that offering any of the bids as the sale price for both sets 
generates a total revenue that is equal to OPT. So the price that generates highest revenue for A also 
generates lowest revenue for B and vice versa. □ 



[Proposition 2| suggests that, for any given A, an equal revenue instance might actually be the worst case 
instance for RSOP among all instances with the same A; however based on computer simulation that seems 
not to be true at least for small values of A. 



7 Because there are 8 d 1 9' possible combinations of intervals in j2 1 1 and 2 d 1 events of the form At and m events of the 
farm 4f +1- "°° } - 

8 It is easy to see that E[max(c, %^c') | A T n £* d+1 " -oo} ] = E[max(c, %V) | {S d = |T|} n 4 d+1 '"°° } ]. 



9 For example if Vj < Vj+n we set v 



A' 

n . 
-3 
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To prove Theorem 3 we need to show the expected revenue of RSOP is no more that ^-gg OPT for any 
equal revenue instance with distinct bids. However a direct analysis of the performance of RSOP for all such 
instances is not easy. Instead we define a modified variant of RSOP whose performance is easy to analyze, 
and whose revenue is close to the revenue of RSOP (e.g., asymptotically equal as A — > oo). 

Definition 4 (RSOP*). The modified random sampling optimal price auction behaves exactly the same way 
as RSOP (see Definition 1), except if all of the non-zero bids fall in the same set, it offers them the lowest 
non-zero bid as the sale price ( instead of 0) . 

Note that RSOP* is not a truthful auction, however it is only used to aid the analysis. Next we show 
that the revenue of RSOP is asymptotically equal to the revenue of RSOP*. 

Lemma 13. E[RSOP] < E[RSOP*] < E[RSOP] + H)^ 1 OPT, with the second inequality being met with 
equality for equal revenue instances. 

Proof. Recall that RSOP* behaves exactly like RSOP except when all the n bids fall in the same set which 
happens with probability (g)™ , in which case RSOP* still generates a revenue of at most OPT (exactly 
OPT if it is an equal revenue instance), while RSOP generates zero revenue. □ 

Lemma 14. E[RSOP*(qW)] is a decreasing function of n. 

Proof. Let Rcv(v, A,p) and Rcv(v, B,p) denote the revenue obtained by offering price p to bidders respec- 
tively in A and B with the vector of bids v. Also let Rev*(v, A) and Rev*(v, B) denote the revenue RSOP* 
obtains respectively from each of A and B under partition (A,B). Observe that Rev*(v, A) = 0, because 
the price that is offered to A is always 1. So it is enough to show that E[Rev* (q*-"-*, B)] is a decreasing 
function of n. 

Let v = q(") and v' = q(" -1 ). We now prove that Rev*(v,B) < Rev*(v',B) which implies the claim 
of the lemma. Let v\ A and v' y denote the prices offered to B by RSOP* respectively on v and v', i.e., 
Aa G arg max eA Rev(v, A,Vj), and A A e arg max j6A Rev(v', A, v'j). There are four possible scenarios: 

(I) n e A and {1 • • • n ~ 1} C B. In this case Rev*(v, B) = 1- A<1 = Rev*(v', B)[^] 

(II) n € A and {1 • • • n — 1} <£. B. In this case either 

(a) Aa = A A < n and so Rev*(v,B) = Rev*(v',B), or 

(b) Aa < A A = n, but that means Rev(v, A, v\ A ) > Rev(v',A,v A , ), therefore it must be Rev* (v, B) = 
Rev(v,B,w AA ) <Rev(v',B,u^) = Rev*(v',B).F] 

(III) n € B and {1 • • • n - 1} C B. In this case Rev*(v, B) = Rev*(v', B) = 1. 

(IV) n £ B and {1 • • • n — 1} $L B. In this case Aa = A A < n and v\ A > v n so v n does not affect the 
revenue, therefore Rev*(v, B) = Rev*(v',B). 

□ 

The following is obtained by direct calculation using a computer. 
Proposition 3. E[RSOP*(q( 400 ))] = 0.377208 ± lO" 6 . 
We now prove the main theorem of this section. 



10 Recall that OPT(q< n )) = 1. 

11 That is because both v and v' are equal revenue instances, therefore Rev(v, B, v\ ) + Rev(v, A, v\. ) = Rev(v', B, v' , ) - 

A 

Rev(v', A, v' x , ) = 1. 
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Proof of Theorem 3\ To prove the theorem for any A we exhibit a bid vector v with A bids above the optimal 
sale price such that E[RSOP(v)] < ^ OPT(v). Let n = max(A,400), then 



E 



RSOP(q (n) ) 



< E [RSOP*(q (n) 

< E [RSOP*(q (400) ) 

< 0.377209 

< — OPT(qW) 
2.65 yH ' 



by |Lemma 13 

by |Lemma 14| 
by [Proposition 3| 
because OPT(q (n) ) 



Observe that the optimal sale price for q( n ' is not unique. Let v be the same as q*-™-* everywhere except 



= q^' + e for a small e G (0, w). Observe that v\ is now the unique optimal sale price for v. It is easy to 
see that lim e ^ E[RSOP(v)] = E[RSOP(q( n ')] and lim e ^ OPT(v) = OPT(q(")) = 1, so for a small enough 
e, we get E[RSOP(v)] < OPT(v) which completes the proof. □ 

6 A Combinatorial Lower Bound 

In this section we present a combinatorial approach for obtaining a lower bound on the expected revenue 
of RSOP for equal revenue instance where each non-zero bid is either 1, or ft, (for some fixed h € N). We 
hope the ideas we present in the section help develop a more general combinatorial approach in the future 
for proving lower bounds on mechanisms based on random sampling. 

Observe that in an equal revenue instance where non-zero bids are either h or 1, if there are k bids of 
value h, there must be k(h — 1) bids of vale 1. Throughout the rest of this section we assume h is an implicit 
constant. The following theorem summarizes the main result of this section. 



Theorem 4. For any equal revenue instance where each non-zero bid is either h or 1, 

'1 1 1 
2 + 2h ~ 2 kh ~ 1 



E[RSOP] > ( I + J--—L- )OPT C-!1i 



where k is the number of bids of value h. 

Observe that in the above theorem the worst case of the lower bound is when k = 1 and h = 2 for which 
the lower bound becomes OPT /4. Notice that the lower bound approaches OPT /2 quickly as either k or h 
increases. 

Definition 5. denotes the multi-set of bid corresponding to an equal revenue instance with k bids of 

value h and k{h — 1) bids of value 1. 

For the rest of this section we assume that A and B are multi-sets containing the actual bids in each side 
of the partition, as opposed to the previous sections where we assumed A and B contained the indices of 
those bids. Furthermore, for any multi-set of bids such as I, we use the notation E [RSOP (I)], E [RSOP* (I)] 
and OPT(I) to denote the respective quantity being computed on bids explicitly specified by I. We also 
make no assumption about which of A or B gets the highest bid, unless explicitly stated otherwise. 



We start by proving a lower bound on the expected revenue of RSOP* (see Definition 4 1 on equal revenue 
instances where each non-zero bid is either h or 1. We then extend the lower bound to RSOP. Recall that 
RSOP* behaves exactly the same way as RSOP, except if all non-zero bids fall in the same set, RSOP* offers 
them the lowest non-zero bid as the sale price (instead of 0) . 



Lemma 15. For any k e N, 

E 



RSOP*(Q^)] > M^±!2 = I OPT(Q< fe )) + k -. (32) 
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Proof. We prove the claim by induction on k. 

We first prove the base case which is k = 1. The single bid of value h is the highest bid. Without loss of 
generality assume that the h bid is in B. Observe that the optimal price of B is h which is also the price 
offered to A, so no revenue is obtained from A. Furthermore the optimal price of A is 1 which is also the 



1 + 



h-l 



which 



price offered to B. Each bid of value 1 falls in B with probability 1/2, so E[RSOP*(Q (1) )] 
proves the base of the induction. 

We now prove the induction step. For any two multi-sets of bids such as T and U, let Rev* (T, U) denote 
the revenue obtained from T by computing the optimal sale price for U (let the optimal price be 1 if U = 0) 
and offering that price to T; also let Rev(T,p) denote the revenue obtained by offering price p to T. Let 
(A,B) be a random partition of Q*- 1 ^, and let (A',B') be a random partition of Q^ 8-1 ). Observe that 
(A U A', B U B') is a random partition of Q^ k \ The induction step follows from the following inequities. 



E 



RSOP*(Q (fc) ) = E [Rev* (A U A', B U B') + Rev*(B UB',AU A')] 

> E [Rev* (A, B) + Rev* (A', B') + Rev*(B, A) + Rev*(B', A')] to be proven 



E 



RSOP*(Q 



+ E 



> 



RSOP*(Q (fc " 1) ) 
h + 1 (k-l)(h+l) k(h+l) 



> 



by the induction hypothesis 



We shall prove Rev* (BUB',AUA')> Rev* (B, A) + Rev* (B', A') and by symmetry we can argue Rev* (A U 
A', BUB') > Rev* (A, B) + Rev* (A', B') which completes the proof. Let p, p' and p" denote the optimal 
price of A, A' and A U A' respectively as computed by RSOP* (i.e., the optimal price for an empty set 
would be 1). We argue that 



Rev* (BUB', A U A') = Rev(B,p") + Rev(B',p") 
> Rev(B,p) +Rev(B',p') 
= Rev*(B,A) +Rev*(B',A'). 



explained below 



Observe that both A U B and A' U B' are equal revenue instances and by Proposition 2 in any equal 
revenue instance the price that is optimal for one side generates the least revenue for the opposite side so 
Rev(B,p") > Rev(B,p) and Rev(B',p") > Rev(B',p'). 

□ 



Proof of \ Theorem J\ Recall that the only situation where RSOP* and RSOP behave differently is when 
either A or B is empty which happens with probability l/2 kh ~ 1 , therefore 



E 



RSOP(Q 



(fc)x 



E 



RSOP*(Q 



(fc)^ 



> 



k(h + l) 
2 

1 1 

2 + 2h 



1 



1 



2kh-l 
1 



2 k 



kh-l 



OPT 
OPT 



OPT 



by |Lemma 15 
because OPT = kh 



That completes the proof. 



□ 
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A Results 



A 


E[RSOP]/OPT 


Competitive- Ratio 


2 


0.125148 


7.99 


3 


0.166930 


5.99 


4 


0.192439 


5.20 


5 


0.209222 


4.78 


6 


0.221407 


4.52 


7 


0.230605 


4.34 


8 


0.237862 


4.20 


9 


0.243764 


4.10 


10 


0.248647 


4.02 


11 


0.252774 


3.96 


15 


0.264398 


3.78 


20 


0.273005 


3.66 


30 


0.282297 


3.54 


50 


0.290384 


3.44 


100 


0.296993 


3.37 


200 


0.300549 


3.33 


300 


0.301784 


3.31 


500 


0.302792 


3.30 


1000 


0.303560 


3.29 


1500 


0.303818 


3.29 


2000 


0.303949 


3.29 



Table 1: Computed numerical values for the basic lower-bound of E[RSOP] > E[^Z] OPT. 



A 


E[RSOP]/OPT 


Competitive-Ratio 


2 


0.2138 


4.68 


3 


0.2178 


4.59 


4 


0.238 


4.20 


5 


0.243 


4.11 


6 


0.2503 


3.99 


7 


0.2545 


3.93 


8 


0.2602 


3.84 


9 


0.2627 


3.81 


10 


0.2669 


3.75 



Table 2: Computed numerical values for the exhaustive-search lower-bound 



B Proofs 



Theorem 5 (Chcrnoff-Hocffding 1963 1. For (i.i.d.) random variables X\, X2, Xg G {0,1} with'E[Xi\ = p, 
the following inequality holds for all e G (0, 1 — p): 



Pr 



Lemma pi. For any a G (0, 1) and j G N, 



< 



p + e 



if a > 0.5, then Pr 
if a < 0.5 — then Pr 



4 j} 



> 1 - (r a ) 3 , 



< r 



(«+i/j)J 



1-P 

1 — p — e 



1— p— e\ 



where 



1 



2a Q (l-a) 1 -« 
where r a is the same as above. 



(33) 



Proof. Let Aj be an indicator random variable which is 1 if j G A, and otherwise. 

The first inequality of the lemma follows immediately from Theorem 5 by setting Xj = Aj, I = j, p = 0.5, 

and e — a — 0.5 which yields an upper bound on Pr[£^] and thus a lower bound on Pr[Ea^]- Note that 



Ai — with probability 1, however that only decreases the probability on the left hand side of (33) so it 
still holds. 
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To prove the second inequality, we proceed as follows. 



Pr 



Pr 



< a 



Pr 



Pr 



< Pr 



> 1 - a 



3 



3 3 

i - 1 3 



because A± = 1 always. 



The second inequality of the lemma now follows immediately from 
p = 0.5, and e = 0.5 — a — i. Note that j\_ „_i = r ,i . 



Theorem 5 



by setting Xj = ^j-i, £ = j — 1) 

□ 



Theorem 6 (Fortuin, Kasteleyn, and Ginibre 1971 ). Lei L be a finite distributive lattice, and [i : L — > R + 



be a function that satisfies 

H{x A y)n{x V y) > (j,(x)fi(y), 



for all x, y G L. 



(34) 



TTien /or any iioo functions f, g : L — ► K + which are either both increasing, or both decreasing, the following 
inequality holds. 



(35) 



Lemma M. For any T, T' C N and a G [0,1], i/ie two events £^ and £^ are positively correlated, i.e., 
Pr[CnC]>Pr[C]Pr[C]- 

Proof. For every n £ N, define T n = T n {1 • • • n}; similarly define T^, A n , B n , etc. 

We start by proving Pr[£j™ n £ a n ] > Pr[£j B ] Pr[£j n ] for every n £ N. Let L„ be a distributive 
lattice whose elements are the subsets of {2 • • • n} and whose meet /join operators correspond to taking 
intersection/union. For all A £ L„ let //(A) = 1/2™ -1 . Define £j n (A) to be an indicator function which is 
defined for each A £ L„ as 



£ a T "(A) 



1 if |A n {1 • • • j}\ < aj for all j £ T„ 
otherwise 



By invoking 



Theorem 6 



on lattice L n and substituting f(x) and g(x) with £ J"(A) and £ Q "(A) respectively 



we get the following inequality. 



v C"(A)gJ" (A) 

/ j On— 1 




£«"(A) 



V AC{2- n} / \AC{2-n} / \AC{2~n} 

Observe that the left hand side of the above inequality is exactly Ea[£„ " (A)£q " (A)] = Pr[£j™ n £ J"] 

while its right hand side is exactly Ea[£ J" (A)] Ea[£o 71 (A)] = Pr [£j n ] Pr[£ J"], so we have proved that 

Pr[£-J" n fa"] > Pr[£j"] Pr[£j"] for every n G N. 

We now prove the infinite case. For every n G N, define = Pr[fJ" n £a n ], r n — Pr[£j™] Pr[fJ™], 
and d„ = l n — r n . Observe that d n is an infinite sequence which is bounded in [0,1], so by invoking 
Bolzano Weierstrass theorem we argue that it has an infinite converging subsequence, i.e., there exists an 
infinite sequence of indices n% < 712 < ••■ and d* G [0, 1] such that linij_ i . 00 d n . — d* . On the other hand 
both l n . and r n . are decreasing sequences which are bounded below by so they both converge, therefore 



Pr 



el n £? 



Pr 



e i 



Pr 



T' 



lim l T< 



lim r n 



lim d r . 



d* > 



which proves the claim of the lemma. 



□ 
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